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Abstract. We study decompositions of the global NVALUE constraint. Our main 
contribution is theoretical: we show that there are propagators for global con- 
straints like NValue which decomposition can simulate with the same time com- 
plexity but with a much greater space complexity. This suggests that the benefit 
of a global propagator may often not be in saving time but in saving space. Our 
■^r , other theoretical contribution is to show for the first time that range consistency 

^ ' can be enforced on NVALUE with the same worst-case time complexity as bound 

^ ^ consistency. Finally, the decompositions we study are readily encoded as linear 

inequalities. We are therefore able to use them in integer linear programs. 

^ ■ 1 Introduction 

O' 

^O ' Global constraints are one of the distinguishing features of constraint programming. 

They capture common modelling patterns and have associated efficient propagators for 
t~^ ' pruning the search space. For example, All-Different is one of the best known 

^D I global constraints that has proven useful in the modelling and solving of many real 

world problems. A number of efficient algorithms have been proposed to propa- 
gate the All-Different constraint (e.g. [1-3]). Whilst there is Httle debate that 
All-Different is a global constraint, the formal definition of a global constraint 
is more difficult to pin down. One property often associated with global constraints is 
^^ • that they cannot be decomposed into simpler constraints without impacting either the 

5^ \ pruning or the efficiency of propagation [4]. Recently progress has been made on the 

theoretical problem of understanding what is and isn't a global constraint. In particular, 
whilst a bound consistency propagator for the All-Different constraint can be ef- 
fectively simulated with a simple decomposition [5], circuit complexity lower bounds 
have been used to prove that a domain consistency propagator for All-Different 
cannot be polynomially simulated by a simple decomposition [6]. 

In this paper, we turn to a strict generalization of the All-Different constraint. 
NValue counts the number of values used by a set of variables; the All-Different 
constraint ensures that this count equals the cardinality of the set. From a theoretical 
perspective, the NValue constraint is significantly more difficult to propagate than the 
All-Different constraint since enforcing domain consistency is known to be NP- 
hard [7]. Moreover, as NValue is a generaUzation of All-Different, there exists no 



polynomial sized decomposition of N VALUE which achieves domain consistency [6]. 
Nevertheless, we show that decomposition can simulate the polynomial time algorithm 
for enforcing bound consistency on NValue but with a significant space complexity. 
We also prove, for the first time, that range consistency on NVALUE can be enforced 
in the same worst case time complexity as bound consistency. This contrasts with the 
All-Different constraint where range consistency takes 0{n^) time [2] but bound 
consistency takes just O(nlogn) time [3]. 

The main value of these decompositions is theoretical as their space complexity is 
equal to their worst case time complexity. When domains are large, this space complex- 
ity may be prohibitive. In the conclusion, we argue why it appears somewhat inevitable 
that the space complexity is equal to the worst case time complexity. These results sug- 
gest new insight into what is and isn't a global constraint: a global constraint either 
provides more pruning than any polynomial sized decomposition or provides the same 
pruning but with lower space complexity. There are several other theoretical reasons 
why the decompositions studied here are interesting. First, it is technically interest- 
ing that a complex propagation algorithm like the bound consistency propagator for 
NValue can be simulated by a simple decomposition. Second, these decompositions 
can be readily encoded as linear inequalities and used in linear programs. In fact, we 
will report experiments using both constraint and integer linear programming with these 
decompositions. Since global constraints are one of the key differentiators between con- 
straint and integer programming, these decompositions provide us with another tool to 
explore the interface between constraint and integer programming. Third, the decompo- 
sitions give insights into how we might add nogood learning to a NVALUE propagator. 

2 Background 

A constraint satisfaction problem (CSP) consists of a set of variables, each with a fi- 
nite domain of values, and a set of constraints. We use capitals for variables and lower 
case for values. We assume values are taken from the set 1 to d. We write dora{Xi) 
for the domain of possible values for Xi, m,in{Xi) for the smallest value in dom{Xi), 
max{Xi) for the greatest, and range{Xi) for the interval [Tnin{Xi), max{Xi)]. Con- 
straint solvers typically use backtracking search to explore the space of partial assign- 
ments. After each assignment, propagation algorithms prune the search space by en- 
forcing local consistency properties like domain, range or bound consistency. A con- 
straint is domain consistent (DC) iff when a variable is assigned any of the values in 
its domain, there exist compatible values in the domains of all the other variables of 
the constraint. Such an assignment is called a support. A CSP is domain consistent iff 
every constraint is domain consistent. A constraint is disentailed iff there is no possi- 
ble support. A propagator which enforces domain consistency will detect disentailment, 
but a propagator that detects just disentailment will not enforce domain consistency. A 
constraint is range consistent (RC) iff, when a variable is assigned any of the values in 
its domain, there exist compatible values between the minimum and maximum domain 
value for all the other variables of the constraint. Such an assignment is called a bound 
support. A constraint is bound consistent (BC) iff the minimum and maximum value 
of every variable of the constraint belong to a bound support. A CSP is bound con- 



sistent iff every constraint is bound consistent. We compute the total amortized cost of 
enforcing a local consistency down an entire branch of the search tree. This captures the 
incremental cost of propagation. Finally, we will assume that a propagator is invoked 
at most once for each domain change and that the solver uses an optimal propagator 
to enforce BC on sum and channeling constraints. Such assumptions hold for modern 
solvers like Gecode and Hog Solver However, we make no assumption about the order 
of invocation of the constraints in a decomposition. The upper bounds we give hold 
regardless of the order in which constraints are processed. 

A global constraint is one in which the arity of the constraint n is a parameter. A 
decomposition of a global constraint is a CSP involving the n variables of the global 
constraint (and possibly others), involving only constraints with fixed arity (no global 
constraint) or constraints that are themselves decomposable, such that the size of the 
CSP is polynomial in the sum of the sizes of the domains of the n original variables, and 
such that the projection of its solutions on those n variables corresponds to the solutions 
of the global constraint. A useful notion is algorithmic globality [4]. Informally, given 
a local consistency property, a global constraint is algorithmically global if there is no 
decomposition on which this local consistency is achieved in the same time and space 
complexity. We suggest here two refinements of this notion of algorithmic globality. 
First, we will separate the space and time complexity. That is, given a local consistency 
property, a global constraint is algorithmically global with respect to time (space) if 
there is no decomposition on which this local consistency is achieved in the same time 
(space) complexity. Second, unlike [4], we consider decompositions that may introduce 
new variables. Our results will show that, when we introduce new variables, N VALUE 
is not algorithmically global with respect to time but is global with respect to space. 

3 NValue constraint 

Pachet and Roy first proposed the N Value constraint [8]. Formally 
NVALVE{[Xi,...,Xn],N) ensures that N = \{Xi | 1 < i < n}\. This gener- 
alizes several other global constraints including All-Different (which ensures 
that the number of values taken by a set of variables equals the cardinality of the set) 
and Not- All-Equal (which ensures a set of variables take more than one value). 
Enforcing domain consistency on the N Value constraint is NP-hard (Theorem 3 in 
[7]) even when N is fixed (Theorem 2 in [9]). In fact, just computing the lower bound 
on N is NP-hard (Theorem 3 in [10]). In addition, enforcing domain consistency on 
the NValue constraint is not fixed parameter tractable since it is VF [2] -complete [11]. 
However, several polynomial propagation algorithms have been proposed that achieve 
bound consistency and some closely related levels of local consistency [12, 9, 13]. 

3.1 Simple decomposition 

Global constraints can often be decomposed into simpler, more primitive and small ar- 
ity constraints. For example, the All-Different constraint can be decomposed into 
a quadratic number of binary inequalities. However, such decomposition often hinders 
propagation and can have a significant impact on the solver's ability to find solutions 



[14]. We can decompose the N VALUE constraint by introducing 0/1 variables to repre- 
sent which values are used and posting a sum constraint on these introduced variables: 



X, = J -^B, = l 


yi <i<n,l<j <d 
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Note that constraint 3 is not a fixed arity constraint, but can itself be decomposed to 
ternary sums without hindering bound propagation. Unfortunately, this simple decom- 
position hinders propagation. It can be BC whereas BC on the corresponding NValue 
constraint detects disentailment. 

Theorem 1 BC on NValue is stronger than BC on its decomposition into (1) to (3). 

Proof: Clearly BC on NValue is at least as strong as BC on the decomposition. To 
show strictness, consider Xi G {1, 2}, X2 G {3, 4}, Bj G {0, 1} for I < j < 4, and 
A^ = 1. Constraints (1) to (3) are BC. However, the corresponding NVALUE constraint 
has no bound support and thus enforcing BC on it detects disentailment. D 

We observe that enforcing DC instead of BC on constraints (1) to (3) in the example 
of the proof above still does not prune any value. To decompose NVALUE without 
hindering propagation, we must look to more complex decompositions. 

3.2 Decomposition into AtMostNValue and AtLeastNValue 

Our first step in decomposing the NVALUE constraint is to split it into 
two parts: an AtMostNValue and an AtLeastNValue constraint. 

ATLEASTNVALUE([Xi,...,X„],iV) holds iff iV < \{X^\1 < i < n}| whilst 
ATMoSTNVALUE([Xi,...,X„],A^)holdsiff |{X,|1 <i<n}\<N. 

Running Example. Consider a NVALUE constraint over the following variables and 
values: 

12 3 4 5 



X2 
X3 
Xi 

X5 

N 

Suppose we decompose this into an AtMostNValue and an AtLeaSTN VALUE con- 
straint. Consider the AtLeastNValue constraint. The 5 variables can take at most 
4 different values because X2, X^, X^, and X^ can only take values 2, 3 and 4. Hence, 
there is no bound support for N — 5. Enforcing BC on the AtLeastNValue con- 
straint therefore prunes N = b. Consider now the AtMostN VALUE constraint. Since 
X2 and X^ guarantee that we take at least 2 different values, there is no bound support 
for N ^ I. Hence enforcing BC on an AtMOSTN VALUE constraint prunes N ~ 1. If 
Xi = 1, 3 or 5, or X^ = 3 then any complete assignment uses at least 3 different val- 
ues. Hence there is also no bound support for these assignments. Pruning these values 



gives bound consistent domains for the original N VALUE constraint: 
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To show that decomposing the N Value constraint into these two parts does not 
hinder propagation in general, we will use the following lemma. Given an assignment 
S of values, card{S) denotes the number of distinct values in S. Given a vector of 
variables X = Xi . . .Xn, cardi^{X) = max{card{S) \ S G IIxiexrange{Xi)} and 
card^lX) = min{card(S) \ S G nXiex'''cmge{Xi)}. 

Lemma 1 (adapted from [13]) Consider NValue([Xi, . . . , X„], TV). If dom{N) C 
[cardi{X) ^ card^{X)\, then the bounds of N have bound supports. 

Proof: Let Smin be an assignment of X in 11 Xi(^x'range{Xi) with card{Smin) = 
cardi{X) and Smax be an assignment of X in IIxi£xrange{Xi) with card{Smax) = 
card^{X). Consider the sequence Smin — S'o, -S*!, . . . , 5„ = Smax where Su+i is the 
same as Sk except that Xk+i has been assigned its value in Smax instead of its value in 
Smin- \card{Sk+i) — card{Sk)\ < 1 because they only differ on Xk+i- Hence, for any 
p G [cardi{X), card^{X)], there exists k G l..n with card{Sk) — P- Thus, {Sk,p) is a 
bound support for p on NValue( [Xi , . . . , Xn] , N). Therefore, ■min{N) and max{N) 
have a bound support. D 

We now prove that decomposing the NValue constraint into AtMostNValue 
and AtLeastN Value constraints does not hinder pruning when enforcing BC. 

Theorem 2 BC on NValue([Xi, . . . ,X„], A^) is equivalent to BC on 
AtMostNValue([Xi, . . . ,Xn],N) and on AtLeastNValue([Xi, . . . ,Xn],N). 

Proof: Suppose the AtMostNValue and AtLeastNValue constraints are BC. 
The AtMostNValue constraint guarantees that cardi{X) < min{N) and the 
AtLeastNValue constraint guarantees that card^(X) > ■max{N). Therefore, 
dom{N) G [cardi{X),card^{X)]. By Lemma 1, the variable N is bound consistent. 

Consider a variable/bound value pair Xi =- b. Let (<5'fg£,sj,pi) be a bound sup- 
port of Xi = b in the AtLeastNValue constraint and (S'^jost'-Pa) be a bound sup- 
port of Xi ^ b in the AtMostNValue constraint. We have card{Si^^^^) > pi 
and card{Smost) < P2 by definition of AtLeastNValue and AtMostNValue. 
Consider the sequence Si^^^^ — Sq,S\,. . . ,Sn = Smost where S'^_|_;^ is the same 
as S^ except that Xk+i has been assigned its value in S^^^^f instead of its value in 
'^leasf \^^''^'^i^k+i) ~ ^^'''^{^X)\ — 1 because they only differ on Xk+i- Hence, 
there exists k G l..n with min{pi,p2) < card{S\) < max{pi,p2)- We know 
that pi and p2 belong to range{N) because they belong to bound supports. Thus, 
card{S\) G range{N) and {S\Tcard{S\)) is a bound support for Xi ~ 6 on 
N Value ([Xi,...,X„],iV). □ 

When enforcing domain consistency, Bessiere et al. [13] noted that decompos- 
ing the NValue constraint into AtMostNValue and AtLeastNValue constraints 



does hinder propagation, but only when dom{N) contains just card^ {X) and card^ {X) 
and there is a gap in the domain in-between (see Theorem 1 in [13] and the discussion 
that follows). When enforcing BC, any such gap in the domain for N is ignored. 

4 AtMostNValue constraint 

We now give a decomposition for the AtMostNValue constraint which does not hin- 
der bound consistency propagation. To decompose the AtMostNValue constraint, 
we introduce 0/1 variables, Au^ to represent whether Xi uses a value in the interval 
[Itu], and "pyramid" variables, Miu with domains [0, min (u — I -\-l,n)] which count 
the number of values taken inside the interval [l^u]. To constrain these introduced vari- 
ables, we post the following constraints: 

Aiiu = 1 <=^ X^ e[l,u\ yi<i<n,l<l<u<d (4) 

Auu <Miu yi<i<n,l<l<u<d (5) 

Miu = Mik + M(fe+i)„ \fl<k<u<d (6) 

Mid < N (7) 

Running Example. Consider the decomposition of an AtMostNValue constraint 
over the following variables and values: 





12 3 4 5 
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X2 
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X:, 
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Xi 


* 


Xn 


* * 


N 


* * 



Observe that we consider that value 5 for N has already been pruned by 
AtLeastNValue, as will be shown in next sections. Bound consistency reasoning 
on the decomposition will make the following inferences. As X2 = 2, from (4) we 
get A222 = 1- Hence by (5), M22 = 1- Similarly, as X4 — 4, we get A444 = 1 
and M44 = 1. Now N € {1,2}. By (7) and (6), M15 < N, M15 = Mu + M55, 
Mi4 = Mi3 + M44, Mi3 = M12 + M33, M12 = Mil + M22. Since M22 = M44 = 1, 
we deduce that N > 1 and hence N = 2. This gives Mu = M33 = M55 = 0. By (5), 
Am — A133 ~ j4i55 = A533 = 0. Finally, from (4), we get Xi = 2 and X^^ — 3. This 
gives us bound consistent domains for the AtMostN VALUE constraint. 

We now prove that this decomposition does not hinder propagation in general. 

Theorem 3 BC on constraints (4) to (7) is equivalent to BC on AtMostNValue 
{[Xi, . . . , Xn], N), and takes 0{nd'^) time to enforce down the branch of the search 
tree. 

Proof: First note that changing the domains of the X variables cannot affect the upper 
bound of N by the AtMostNValue constraint and, conversely, changing the lower 
bound of N cannot affect the domains of the X variables. 



Let Y — {Xp-^ , . . . , Xp^ } be a maximum cardinality subset of variables of X whose 
ranges are pairwise disjoint (i.e., range{Xp.) D range{Xp.) — 0, Vi, j £ l..fc, i ^ j). 
Let ly — {\bi,Ci] I hi — min{Xp^), Ci = ■max{Xp^),Xp. G Y} be the corresponding 
ordered set of disjoint ranges of the variables in Y. It has been shown in [9] that \Y\ = 
cardiiX). 

Consider the interval [bi, Ci] G ly- Constraints (5) ensure that the variables Mi,.^ 
i — [1, . . . , fc] are greater than or equal to 1 and constraints (6) ensure that the vari- 
able Mirf is greater than or equal to the sum of lower bounds of variables Mi,. a, 
i = [1, . . . , fc], because intervals [bi, Ci] are disjoint. Therefore, the variable N is greater 
than or equal to cardi{X) and it is bound consistent. 

We show that when N is BC and dom[N) ^ {cardi{Xy}, all X variables are BC. 
Take any assignment S <E nXiexf'ange{Xi) such that card{S) — cardi{X). Let 
S[Xi <r- b] be the assignment S where the value of Xi in S has been replaced by b, one 
of the bounds of Xi. We know that card{S[Xi ^ b]) e [card{S) - 1, card{S) + 1] = 
[cardi{X) — 1, cardi{X) + 1] because only one variable has been flipped. Hence, any 
assignment {S,p) with p > cardi{X) + 1 is a bound support. dom{N) necessarily 
contains such a value p by assumption. 

The only case when pruning might occur is if the variable N is ground and 
card^{X) — N . Constraints (6) imply that Mid equals the sum of variables Mi^bj_i + 
Mh^^ct+Mc^+i^h2-i ■ . . + Mb„^c„ +A^c„+i,d- The lower bound of the variable Mc,,b. 
is greater than one and there are |y| — card^{X) — A^of these intervals. Therefore, by 
constraint (7), the upper bound of variables Mc._j+i,(,j-i that correspond to intervals 
outside the set ly are forced to zero. 

There are 0{nd^) constraints (4) and constraints (5) that can be woken 0{d) times 
down the branch of the search tree. Each requires 0(1) time for a total of 0{nd^ ) down 
the branch. There are 0{d^) constraints (6) which can be woken 0{n) times down the 
branch and each invocation takes 0(1) time. This gives a total of 0{nd^). The final 
complexity down the branch of the search tree is therefore 0{nd^). D 

The proof of theorem 3 also provides the corollary that enforcing range on con- 
sistency on constraints 4 enforces range consistency on AtMostNValue. Note that 
theorem 3 shows that the BC propagator of AtMostNValue [12] is not algorithmi- 
cally global with respect to time, as BC can be achieved with a decomposition with 
comparable time complexity. On the other hand, the 0{nd^) space complexity of this 
decomposition suggests that it is algorithmically global with respect to space. Of course, 
we only provide upper bounds here, so it may be that AtMostNValue is not algo- 
rithmically global with respect to either time or space. 

5 Faster decompositions 

We can improve how the solver handles this decomposition of the AtMostNValue 
constraint by adding implied constraints and by implementing specialized propagators. 
Our first improvement is to add an implied constraint and enforce BC on it: 

d 

Mid = Y. ^" (8) 

1=1 



This does not change the asymptotic complexity of reasoning with the decomposition, 
nor does it improve the level of propagation achieved. However, we have found that the 
fixed point of propagation is reached quicker in practice with such an implied constraint. 
Our second improvement decreases the asymptotic complexity of enforcing BC on 
the decomposition of Section 4. The complexity is dominated by reasoning with con- 
straints (4) which channel from Xi to Au^ and thence onto M;„ (through constraints 
(5)). If constraints (4) are not woken uselessly, enforcing BC costs 0(1) per constraint 
down the branch. Unfortunately, existing solvers wake up such constraints as soon as a 
bound is modified, thus giving a cost in 0{d). We therefore implemented a specialized 
propagator to channel between Xi and M;„ efficiently. To be more precise, we remove 
the 0{n(f) variables Auu and replace them with 0{nd) Boolean variables Zij. We 
then add the following constraints 

Z,j = l ^^X,<j l<j<d (9) 

Zm_i) = 1 V Z,„ = V Miu >0 1 <l <u<d,l <i <n (10) 

These constraints are enough to channel changes in the bounds of the X variables 
to Miu- There are 0{nd) constraints (9), each of which can be propagated in time 0{d) 
over a branch, for a total of 0(nd^ ) . There are O {nd^ ) clausal constraints (10) and each 
of them can be made BC in time 0(1) down a branch of the search tree, for a total cost 
of 0{nd^ ) . Since channeling dominates the asymptotic complexity of the entire decom- 
position of Section 4, this improves the complexity of this decomposition to O(nd^). 
This is similar to the technique used in [5] to improve the asymptotic complexity of the 
decomposition of the All-Different constraint. 

Our third improvement is to enforce stronger pruning by observing that when 
Miu — 0, we can remove the interval [I, u] from all variables, regardless of whether 
this modifies their bounds. This corresponds to enforcing RC on constraints (4). Inter- 
estingly, this is sufficient to achieve RC on the AtMostNValue constraint. Unfortu- 
nately, constraints (10) cannot achieve this pruning and using constraints (4) increases 
the complexity of the decomposition back to 0{nd'^). Instead we extend the decompo- 
sition with 0{d\ogd) Boolean variables i?ii(i+2i=) G [0, 1], 1 < J < n, 1 < / < d, < 
k < [logdj. The following constraint ensures that Bijj = 1 ^=^ Xi = j. 

DomainBitmap(X,, [5,11, . • • , Bm]) (H) 

Clearly we can enforce RC on this constraint in time 0{d) over a branch, and 0{nd) 
for all variables Xi. We can then use the following clausal constraints to channel from 
variables Miu to these variables and on to the X variables. These constraints are posted 
for every 1 < i < n,l < I < u < d,l < j < d and integers k such that < fc < 

LlogdJ: 



'ii(i+2fc + i-l) — 1 V Sy(j+2'=-l) — 








(12) 


',j(j+2fc + i_i) = 1 V -Bj(j+2'=)0+2'= + i-l) 


= 






(13) 


Miu^0VBa(i+2>'-i) =0 




2'' <u- 


-1 + K 2^+^ 


(14) 


Mi„ ^ V S,(„_2'^- + l)n = 




2^ <u- 


-l + l< 2*^+1 


(15) 



The variable -Bi;((+2fc--i), similarly to the variables Aiu, is true when Xi G [1,1 + 
2*^ — 1], but instead of having one such variable for every interval, we only have them for 
intervals whose length is a power of two. When Miu = 0, with 2*^ < u — l + 1 < 2*^+^, 
the constraints (14)-(15) set to the B variables that correspond to the two intervals of 
length 2*^ that start at I and finish at u, respectively. In turn, the constraints (12)-(13) 
set to the B variables that correspond to intervals of length 2'^"^, all the way down to 
intervals of size 1. These trigger the constraints (11), so all values in the interval [I, u] 
are removed from the domains of all variables. 

Example. Suppose Xi e [5, 9]. Then, by (9), Zu = 0, Zig = 1 and by (10), M59 > 0. 
Conversely, suppose M59 — and Xi G [1, 10]. Then, by (14)-(15), we get -B158 = 
and BiQg = 0. From Bi^s — and (12)-(13) we get -B156 = 0, Bng = 0, -B155 = 
B166 = -B177 = -B188 = 0, and by (11), the interval [5, 8] is pruned from Xi. Similarly, 
Bieg = causes the interval [6, 9] to be removed from Xi, so Xi G [1, 4] U {10}. 

Note that RC can be enforced on each of these constraints in constant time over a 
branch. There exist 0{nd\ogd) of the constraints (12)-(13) and 0{ncf) of the con- 
straints (14)-(15), so the total time to propagate them all down a branch is 0{nd^). 

6 AtLeastNValue constraint 

There is a similar decomposition for the AtLeastNValue constraint. We introduce 
0/1 variables, Auu to represent whether Xi uses a value in the interval [l,u], and integer 
variables, Eiu with domains [0, n] to count the number of times values in [/, u] are re- 
used, that is, how much the number of variables taking values in [l,u] exceeds the 
number u — / + 1 of values in [l,u]. To constrain these introduced variables, we post the 
following constraints: 

Auu == 1 <=> X,e[l,u\ y l<i <n,l<l <u<d (16) 

Eiu > Er=i ^au -{u-l + 1) Vl<l<u<d (17) 

Eiu = Elk + E(k+i)u yi<k<u<d (18) 

N<n-Eid (19) 

Running Example. Consider the decomposition of an AtLeastNValue constraint 
over the following variables and values: 
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X, 
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X2 
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x^ 


* * * 


Xi 
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X5 


* * 


N 
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Bound consistency reasoning on the decomposition will make the following inferences. 
As dom{Xi) C [2,4] for i G 2. .5, from (16) we get Ai2i — \ for i G 2. .5. Hence, by 
(17), S24 > 1. By (18), Si5 = ^14 + £^55, £^14 = £11 +£24- Since £24 > 1 we deduce 
that £15 > 1. Finally, from (19) and the fact that n ~ b, we get N < A. This gives us 
bound consistent domains for the AtLeastNValue constraint. 



We now prove that this decomposition does not hinder propagation in general. 

Theorem 4 BC on the constraints (16) to (19) is equivalent to BC on 
AtLeastNValue {[Xi, . . . ,Xn],N), and takes 0{nd?') time to enforce down the 
branch of the search tree. 

Proof: First note that changing the domains of the X variables cannot affect the lower 
bound of N by the AtLeastNValue constraint and, conversely, changing the upper 
bound of N cannot affect the domains of the X variables. 

It is known [12] that card^{X) is equal to the size of a maximum matching Ad in 
the value graph of the constraint. Since N <n~ Eid, we show that the lower bound of 
Eld is equal to n — \M\.^ We first show that we can construct a matching M{E) of size 
n — min{Eid), then show that it is a maximum matching. The proof uses a partition of 
the interval [1, d] into a set of maximal saturated intervals / = {[bj, Cj]}, j = 1, . . . ,k 
such that min{Eb.^c) = X]r=i "^*"(^i6 c ) — [cj — bj + 1) and a set of unsaturated 
intervals { [bj , Cj] }such that min{Ei, . ^c ) =0. 

Let / = {[^jj Cj] I j e [1 . . . fc]} be the ordered set of maximal intervals such 
that min{Ebj^cj) — X]"=i ^TT'in{Aii,-cj) — {cj — bj + 1). Note that the intervals in 
I are disjoint otherwise intervals are not maximal. An interval [bi, Ci] is smaller than 
[bj,Cj] iff Ci < bj. We denote the union of the first j intervals D-'j = [Jl^i[bi,Ci], 
j = [1, . . . ,k], p — \Dj\ and the variables whose domain is inside one of intervals / 
Xi = {Xp^\doTn{Xp^) C Df}. 

Our construction of a matching uses two sets of variables, Xj and X \ Xj. First, we 
identify the cardinality of these two sets. Namely, we show that the size of the set Xj is 
p + min{Ei^d) and the size of the set X \ Xj is n — (p + min{Ei,d)). 

Intervals / are saturated therefore each value from these intervals are taken by a 
variable in Xi. Therefore, Xi has size at least p. Moreover, there exist min{Eid) 
additional variables that take values from D^, because values from intervals be- 
tween two consecutive intervals in / do not contribute to the lower bound of the 
variable E by construction of /. Therefore, the number of variables in D^ is at 
least p + min{Ei,d). Note that constraints (18) imply that Eid equals the sum of 
variables Ei^^^^i + E^^^ct + ^ci+1,62-1 • • • + Eb^,Ck + -^c^+i^d- As intervals in / 
are disjoint then J2i=i^^i^bi,ci) — \Xi\ — p. If \Xi\ > p + min{Ei^d) then 
X]i=i "^^"•(-^bi.ci) > ^n{Eid) and the lower bound of the variable Eid will be in- 
creased. Hence, \Xj\ = p + min{Eid). 

Since all these intervals are saturated, we can construct a matching Mj of size p 
using the variables in Xj. The size of X \ Xj is n — p — ■min{Eid). We show by 
contradiction that we can construct a matching M^_^k of size n — p — min{Eid) 
using the variables in X \ Xj and the values D — D'j. 

Suppose such a matching does not exist. Then, there exists an interval [6, c] such 
that \{D \ Dj) n [b,c]\ < X^iexxx "^*"(^ibc), i-C-, after consuming the values in 
I with variables in Xj, we are left with fewer values in [b, c] than variables whose 
domain is contained in [b, c]. We denote p' = \ [b, c] D D'l\, so that p' is the number of 
values inside the interval [6, c] that are taken by variables in Xj. The total number of 



^ We assume that Eid is not pruned by other constraints. 



variables inside the interval [b, c] is greater than or equal to X]r=i '^^"■(^itc)- The total 
number of variables Xj inside the interval [b, c] equals to p' + min{Ei,,c)- Therefore, 
J2i£X\Xi "^^"•(^i&c) < X]"=i rnin{Aibc) — p' — min{Eb^c)- On the other hand, the 
number of values that are not taken by the variables Xj in the interval [b, c] is c — 
b + 1 — p'. Therefore, we obtain the inequality c — 6 + 1— p' < X^ILi '^^"■(^ibc) — 
p' — min{Eb^c) or min{Ei,c) < J27=i m'^n{Aibc) — (c — 6 + 1). By construction of 
/, X]r=i ^'iT-i^ibc) — (c — fe + 1) < min{Ebc), otherwise the intervals in / that are 
subsets of [&, c] are not maximal. This leads to a contradiction, so we can construct a 
matching M{E) of size n — min{Eid)- 

Now suppose that M{E) is not a maximum matching. This means that min{Eid) 
is overestimated by propagation on (16) and (19). Since M{E) is not a maximum 
matching, there exists an augmenting path of M{E), that produces M' , such that 
\M'\ — \M{E)\ + 1. This new matching covers all the values that M{E) covers and 
one additional value q. We show that q cannot belong to the interval [1, d]. 

The value q cannot be in any interval in /, because all values in [6^, q] e / are used 
by variables whose domain is contained in [6^, c^]. In addition, q cannot be in an interval 
[&, c] between two consecutive intervals in /, because those intervals do not contribute 
to the lower bound of Eid- Thus, M' cannot cover more values than M{E) and they 
must have the same size, a contradiction. 

We show that when N is BC and dom{N) ^ {card^{X)}, all X variables are BC. 
Take any assignment S <E nXiexf'ange{Xi) such that card{S) ~ card^{X). Let 
S[Xi 4- b] be the assignment S where the value of Xi in S has been replaced by b, one 
of the bounds of Xi. We know that card{S[Xi ^ b]) e [card{S) - 1, card{S) + 1] = 
[card^{X) — 1, card^{X) + 1] because only one variable has been flipped. Hence, any 
assignment (S*, p) with p < cardi^{X) — 1 is a bound support. dom{N) necessarily 
contains such a value p by assumption. 

We now show that if A^ = card^{X), enforcing BC on the constraints (16)-(19) 
makes the variables X BC with respect to the AtLeastNValue constraint. We first 
observe that in a bound support, variables X must take the maximum number of dif- 
ferent values because N =- card^{X). Hence, in a bound support, variables X that are 
not included in a saturated interval will take values outside any saturated interval they 
overlap and they all take different values. We recall that ■min{Eid) = n — \M\ = 
n — cardf{X). Hence, by constraint (19), Eid — n — N . We recall the the size 
of set Xi equals p + Eid- Constraints (18) imply that Eid equals the sum of vari- 
ables £'i,6i_i +£;bi,ci + Sci+l,f)2-l ■■■+Eb^,Ck + Eck + lA^^AY.i=l^^^{Eb,,c,) = 

\Xi\ — p = min{Eid) — max{Eid)- Hence, by constraints (18), the upper bounds of 
all variables Eb^.a that correspond to the saturated intervals are forced to min{Eb^^ci)- 
Thus, by constraints (16) and (17), all variables in X \ Xj have their bounds pruned 
if they belong to Dj. By constraints (18) again, the upper bounds of all variables Eiu 
that correspond to the unsaturated intervals are forced to take value 0, and all variables 
Ei'u' with [I' , u'] C [I, u] are forced to as well. Thus, by constraints (16) and (17), all 
variables in X \ Xj have their bounds pruned if they belong to a Hall interval of other 
variables in X \ Xj. This is what BC on the All-Different constraint does [5]. 

There are 0{nd^) constraints (16) that can be woken 0{d) times down the branch 
of the search tree in 0(1), so a total of 0{nd^) down the branch. There are 0{d^) 



constraints (17) which can be propagated in time 0{n) down the branch for a 0{ncP). 
There are 0{cf) constraints (18) which can be woken 0{n) times each down the branch 
for a total cost in 0{n) time down the branch. Thus a total of O(ncP). The final com- 
plexity down the branch of the search tree is therefore 0{nd^). D 

The complexity of enforcing BC on AtLeastN Value can be improved to 0{n(P) 
in a way similar to that described in Section 5 and in [5]. As with AtMostNValue, 
enforcing RC on constraints (16) enforces RC on AtLeastNValue, but in this case 
we cannot reduce the complexity below 0{nd?'). Similarly to AtMostN Value, the- 
orem 4 shows that the bound consistency propagator of AtLeastNValue is not algo- 
rithmically global with respect to time and provides evidence that it is algorithmically 
global with respect to space. 

7 Experimental results 

As noted before, the main value of these decompositions is theoretical: demonstrating 
that the bound consistency propagator of [12] for the N Value constraint can be sim- 
ulated using a simple decomposition with comparable time complexity over a branch 
of the search tree but greater space complexity. To see when this space complexity hits, 
we performed some experiments. We used a benchmark problem, the dominating set of 
the Queen's graph used in previous studies of N Value [13] and ran experiments with 
Hog Solver 6.2 and Dog CPLEX 9. 1 on an Intel Xeon 4 CPU, 2.0 Ghz, 4Gb RAM. The 
dominating set of the Queen's graph problem is to put the minimum number of queens 
on a n X n chessboard, so that each square either contains a queen or is attacked by one. 
This is equivalent to the dominating set problem of the Queen's graph. Each vertex in 
the Queen's graph corresponds to a square of the chessboard and there exists an edge 
between two vertices iff a queen from one square can attack a queen from the other 
square. To model the problem, we use a variable Xi for each square, and values from 
1 to n^ and post a single AtMostNValue([Xi, . . . , Xn2],N) constraint. The value 
j belongs to dom{Xi) iff there exists an edge {i, j) in the Queen's graph or j = i. For 
n < 120, all minimum dominating sets for the Queen's problem are either of size \n/2] 
or \n/2 + 1] [15]. We therefore only solved instances for these two values of N. 

We compare our decomposition with the simple decomposition of the 
AtMostNValue constraint in Hog Solver and Hog CPLEX solvers. The simple de- 
composition is the one described in Section 3.1 except that in constraint (3), we replace 
"=" by "<". We denote this decomposition Occs and Occs'-'^^^^ in Hog Solver and 
CPLEX, respectively. To encode this decomposition into an integer linear program, we 
introduce literals bij, i,j G [1, n^] and use a direct encoding with bij for the truth of 
Xi — j and channeling inequalities 1 — bij + Bj > 1, i,j £ [l,n'^]. We use the di- 
rect encoding of variables domains to avoid using logic constraints, like disjunction and 
implication constraints in CPLEX. The default transformation of logic constraints in 
CPLEX appears to generate large ILP models and this slows down the search. 

The BC decomposition is described in Section 4, which we call PyramidBC and 
Pyramidg^^^-^ in Hog Solver and CPLEX, respectively. In Hog Solver, as explained 
in Section 5, we channel the variables Xi directly to the pyramid variables Miu to 
avoid introducing many auxiliary variables Anu and we add the redundant constraint 



12i=i ^ii — ^i.n^ to the decomposition to speed up the propagation across the pyra- 
mid. We re-implemented the ternary sum constraint in Hog for a 30% speedup. 

To encode the BC decomposition into an integer linear program, we use the linear 
encoding of variables domains [16]. We introduce literals c^ for the truth of Xi < j, 
and the channeling inequalities of the form Ci((_i) + 1 — Ci„ + Af;„ > 1. We again 

2 

add the redundant constraint X^ILi ^a ~ ^i.n^- Finally, we post constraints (6) as 
lazy constraints in CLPEX. Lazy constraints are constraints that are not expected to 
be violated when they are omitted. These constraints are not taken into account in the 
relaxation of the problem and are only included when they violate an integral solution. 



Table 1. Backtracks and rumtime (in seconds) to solve the dominating set problem for the 
Queen's graph. 



n 


iV 


Occs 


PyramidBC 


Occs'^^^'''' 


Pyramid^^^'^^ 




backtracks time 


backtracks time 


backtracks time 


backtracks time 


5 
6 

7 
8 


3 
3 

4 
5 


34 0.01 

540 0.16 

195,212 84.50 

390,717 255.64 


7 0.00 

118 0.03 

83,731 15.49 

256,582 58.42 


1 0.05 

2 0.16 
130,010 1802.49 

24,588 585.07 


3 0.4 

183 9.6 

63 15.8 

30 41.28 



Results of our experiments are presented in Table 1 . Our BC decomposition per- 
forms better than the Occs decomposition, both in runtime and in number of back- 
tracks needed by Hog Solver or CPLEX. CPLEX is slower per node than Hog Solver. 
However, CPLEX usually requires fewer backtracks compared to ILOG Solver. In- 
terestingly CPLEX performs well with the BC decomposition. The time to explore 
each node is large, reflecting the size of decomposition, but the number of search 
nodes explored is small. We conjecture that integer linear programming methods 
like CPLEX will perform in a similar way with other decompositions of global con- 
straints which do not hinder propagation (e.g. the decompositions we have proposed for 
All-Different and GCC). FinaUy, the best results here are comparable with those 
for the AtMostN Value bounds consistency propagator in [13]. 

8 Other related work 



Bessiere et al. consider a number of different methods to compute a lower bound on the 
number of values used by a set of variables [13]. One method is based on a simple linear 
relaxation of the minimum hitting set problem. This gives a propagation algorithm that 
achieves a level of consistency strictly stronger than bound consistency on the N Value 
constraint. Cheaper approximations are also proposed based on greedy heuristics and 
an approximation for the independence number of the interval graph due to Turan. De- 
compositions have been given for a number of other global constraints. For example, 
Beldiceanu et al. identify conditions under which global constraints specified as au- 
tomata can be decomposed into signature and transition constraints without hindering 



propagation [17]. As a second example, many global constraints can be decomposed 
using Roots and Range which can themselves be propagated effectively using sim- 
ple decompositions [18]. As a third example, the REGULAR and Cfg constraints can be 
decomposed without hindering propagation [19, 20]. As a fourth example, decomposi- 
tions of the Sequence constraint have been shown to be effective [21]. Most recently, 
we demonstrated that the All-Different and GCC constraint can be decomposed 
into simple primitive constraints without hindering bound consistency propagation [5]. 
These decompositions also introduced variables to count variables using values in an 
interval. For example, the decomposition of All-Different ensures that no interval 
has more variables taking values in the interval than the number of values in the inter- 
val. Using a circuit complexity lower bound, we also proved that there is no polynomial 
sized SAT decomposition of the All-Different constraint (and therefore of its gen- 
eralizations like N Value) on which unit propagation achieves domain consistency [6]. 
Our use of "pyramid" variables is similar to the use of the "partial sums" variables in the 
encoding of the SEQUENCE constraint in [21]. This is related to the cumulative sums 
computed in [22] . 

9 Conclusions 

We have studied a number of decompositions of the NValue constraint. We have 
shown that a simple decomposition can simulate the bound consistency propagator for 
NValue [12] with comparable time complexity but with a much greater space com- 
plexity. This supports the conclusion that the benefit of a global propagator may of- 
ten not be in saving time but in saving space. Our other theoretical contribution is to 
show the first range consistency algorithm for NValue, that runs in 0{nd!^) time and 
0{n(P) space. These results are largely interesting from a theoretical perspective. They 
help us understand the globality of global constraints. They highlight that saving space 
may be one of the important advantages provided by propagators for global constraints. 
We have seen that the space complexity of decompositions of many propagators equals 
the worst case time complexity (e.g. for the All-Different, GCC, Among, Lex, 
Regular, Cfg and Sequence constraints). For global constraints like Regular, 
the space complexity of the decompositions does not appear to be that problematic. 
However, for global constraints like NVALUE, the space complexity of the decompo- 
sitions is onerous. This space complexity seems hard to avoid. For example, consider 
encodings into satisfiability and unit propagation as our inference method. As unit prop- 
agation is linear in time in the size of the encoding, it is somewhat inevitable that the 
size of any encoding is the same as the worst-case time complexity of any propagator 
that is being simulated. One other benefit of these decompositions is that they help us 
explore the interface between constraint and integer linear programming. For exam- 
ple, we saw that an integer programming solver performed relatively well with these 
decompositions. 
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